Crop layer for automatically aligning computations #1976

shelhamer · 2015-02-26T00:21:51Z

master edition of #1639 -- thanks to a rebase by @philkr. After #1974 and #1975.

Existing layers shift and warp coordinate space: translation by padding (or lack thereof), contraction by strided convolution or pooling, and expansion by strided deconvolution (#1615). Often one wants to align two blobs, e.g., to establish a correspondence between input and output, or to fuse two different paths of computation. Counting conv/deconv strides to ensure that blob coordinates have the same scale is generally straightforward. Computing the offset between two blobs that results from intermediate padding and kernel sizes is trickier.

This layer takes two bottom blobs and produces one top, which is a copy of the first bottom cropped to the size of the second so that coordinates exactly correspond, i.e., it makes sense to fuse or compare the top blob with the second bottom, regardless of whatever padding or other shenanigans took place between their computation.

This is done by computing the coordinate mapping between the two bottom blobs, as provided by #1637 and made accessible by #1638. If that mapping is a simple translation, and has the right sign to allow the first blob to be "cropped to" the second, the layer simply performs the copy. If the mapping is not an integer translation, or the translation has the wrong sign, an error is thrown, and the net may be rearranged to allow sensible fusion.

The implementation of LayerSetUp amounts to some simple graph traversal to find the path connecting the two bottoms. Currently Net does not provide great facilities for traversing the layer graph, so it's a bit cumbersome; maybe this can be improved in the future.

There is a bit of engineering involved in these three PRs, but the result is pretty convenient: what was before a tricky offline calculation becomes a trivial layer specification.

Another way to implement this, without #1974, would be to remove the graph traversal from CropLayer, giving it a simple parameter instead, and provide some other mechanism for automatically filling in that parameter.

Currently CPU and GPU (trivially) are provided, but tests and documentation are not.

This will be useful for keeping track of the coordinate transformations induced by, e.g., convolution and pooling layers.

This allows layers to do things that depend, e.g., on net topology.

Crop layer for automatically aligning computations * shelhamer/crop-layer: add CropLayer for cropping one blob to another using induced coordinates layers get a pointer back to their owning Net implement coord_map for all applicable layers add FilterMap for the coord mapping used by (de)conv and pooling layers add util/coords.hpp for coordinate mapping functions

Crop layer for automatically aligning computations

Crop layer for automatically aligning computations Conflicts: include/caffe/vision_layers.hpp

Crop layer for automatically aligning computations

ctensmeyer · 2015-07-28T21:47:00Z

This is a great thing. I have one suggestion about specifying the operation. The actual data of one of the bottom blobs is not needed; you only need its shape and the coordinate mapping so you can perform the crop. Instead of specifying the "crop like this" blob as a bottom blob, it could instead be referenced by name in another field in a new CropLayerParameter protobuf message. This way, we avoid introducing a split layer (and the additional data copying) that would occur with making it a bottom blob. This reduces GPU memory usage and could allow larger networks to use this.

longjon · 2015-09-26T07:33:07Z

For those watching, this is due for an update with a less intrusive version (like the "another way" above) that takes advantage of net spec, after which I think it'll be ready for merge.

@waldol1, yes, that's a good point about the extra allocation. Unfortunately you can't really specify a layer name in a parameter (without some extra mechanism), since that breaks the layer abstraction (a layer has no way to access other layers (without the backpointer to its net, which is present here but removed in the "less intrusive version" above) except through its top and bottom links). Let's think more about the right way to address that!

mohomran · 2015-11-28T02:25:38Z

@longjon: Does the less intrusive version already exist in some form? I'm willing to pitch in either way to make this ready for merge.

Crop layer for automatically aligning computations # Conflicts: # include/caffe/common_layers.hpp # include/caffe/layer.hpp # include/caffe/neuron_layers.hpp # include/caffe/vision_layers.hpp # src/caffe/net.cpp

BlGene · 2016-01-15T17:40:01Z

@longjon Do you know when the netspec version of this PR can be committed?

In lieu of this I did a naive rebased this PR here. Which sefaults for some reason, 😬, probably due to the now different sharing layers from root.

I didn't look at this too closely, but for the less intrusive version I guess the idea is for the crop layer to have access to the other layers over the layer sharing through net functionality ( whats the right name for this?). Then the only hurdle would be that the variables that determine the transformation are stored in different formats ( kernel_shape_ as Blob in conv and deconv, vs kernel_h_, kernel_w_ in pool ). This would mean (a.) either storing all transformation in the same format, or (b.) special code in the crop layer to know where it should look. --This is pure conjecture.

BR, Max

longjon · 2016-01-15T23:22:48Z

Sorry for the wait folks; I hope to have an update on this next week.

You may want to take a look at my most recently rebased version (still fairly out-of-date, but less than this PR) at https://github.com/longjon/caffe/tree/future.

@BlGene: not really, the new way I prefer to do this is:

All magic is removed from the Crop layer, which takes crop values as normal parameters.
The crop amounts are computed ahead-of-time by Python code that reads the net graph.

This moves the magic from inside a layer to outside the net, preserving the layers of abstraction without hindering future functionality; as a bonus it makes the crop values discoverable to the user, and can be adapted for other features that rely on computing the coordinate maps. More details to come!

BlGene · 2016-01-18T13:52:32Z

@longjon

Yes, that would be even less invasive. I updated my rebase to so that the crop layer should now work this way. The layer only works for 4D blobs, one could in theory extend it.

The next step would be to write a demo python script that calculates appropriate crop offset parameters for the FCN net for example.

BR, Max

shelhamer · 2016-02-27T08:15:19Z

Replaced by N-D crop in #3570.

longjon added 5 commits February 25, 2015 16:17

add util/coords.hpp for coordinate mapping functions

43fb9ca

This will be useful for keeping track of the coordinate transformations induced by, e.g., convolution and pooling layers.

add FilterMap for the coord mapping used by (de)conv and pooling layers

6038911

implement coord_map for all applicable layers

918ac72

layers get a pointer back to their owning Net

2eef4c3

This allows layers to do things that depend, e.g., on net topology.

add CropLayer for cropping one blob to another using induced coordinates

a1c0fb2

shelhamer mentioned this pull request Feb 26, 2015

Crop layer for automatically aligning computations #1639

Closed

shelhamer added the focus label Feb 26, 2015

shelhamer mentioned this pull request Mar 10, 2015

Repack layer #1716

Closed

longjon added a commit to longjon/caffe that referenced this pull request Mar 10, 2015

Merge pull request BVLC#1976 from shelhamer/crop-layer

5ac94c1

Crop layer for automatically aligning computations

longjon added a commit to longjon/caffe that referenced this pull request Mar 10, 2015

Merge pull request BVLC#1976 from shelhamer/crop-layer

72bf6f3

Crop layer for automatically aligning computations

longjon added a commit to longjon/caffe that referenced this pull request Mar 10, 2015

Merge pull request BVLC#1976 from shelhamer/crop-layer

5738a5b

Crop layer for automatically aligning computations

shelhamer mentioned this pull request Mar 20, 2015

Remove spurious inclusions of net.hpp #2168

Merged

weiliu89 added a commit to weiliu89/caffe that referenced this pull request Apr 1, 2015

Merge pull request BVLC#1976 from shelhamer/crop-layer

c65c71d

Crop layer for automatically aligning computations

elleryrussell pushed a commit to elleryrussell/caffe that referenced this pull request May 1, 2015

Merge pull request BVLC#1976 from shelhamer/crop-layer

ff7428d

Crop layer for automatically aligning computations

elleryrussell pushed a commit to elleryrussell/caffe that referenced this pull request Jul 3, 2015

Merge pull request BVLC#1976 from shelhamer/crop-layer

b006eaf

Crop layer for automatically aligning computations Conflicts: include/caffe/vision_layers.hpp

twerdster pushed a commit to twerdster/caffe that referenced this pull request Jul 19, 2015

Merge pull request BVLC#1976 from shelhamer/crop-layer

a70af4a

Crop layer for automatically aligning computations

kashefy mentioned this pull request Jul 22, 2015

Fully Convolutional Semantic Segmentation error #2788

Closed

kashefy mentioned this pull request Aug 3, 2015

Unable to install caffe-future. longjon/caffe#1

Closed

longjon mentioned this pull request Sep 26, 2015

conv and deconv shape inconsistent #3119

Closed

acgtyrant mentioned this pull request Nov 25, 2015

How do I access the kernel_height from Blob in a function? #3384

Closed

tornadomeet mentioned this pull request Dec 28, 2015

[RFC] the fcn-xs example for image segmentation apache/mxnet#975

Merged

This was referenced Jan 18, 2016

Simplified CropLayer #3565

Closed

ND Crop layer #3570

Merged

bittnt mentioned this pull request Jan 26, 2016

/multi_stage_meanfield.cpp torrvision/crfasrnn#19

Closed

longjon mentioned this pull request Jan 30, 2016

Python/net spec coordinate map and crop computation #3613

Merged

8 tasks

shelhamer removed the focus label Feb 27, 2016

shelhamer closed this Feb 27, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Crop layer for automatically aligning computations #1976

Crop layer for automatically aligning computations #1976

shelhamer commented Feb 26, 2015

ctensmeyer commented Jul 28, 2015

longjon commented Sep 26, 2015

mohomran commented Nov 28, 2015

BlGene commented Jan 15, 2016

longjon commented Jan 15, 2016

BlGene commented Jan 18, 2016

shelhamer commented Feb 27, 2016

Crop layer for automatically aligning computations #1976

Crop layer for automatically aligning computations #1976

Conversation

shelhamer commented Feb 26, 2015

ctensmeyer commented Jul 28, 2015

longjon commented Sep 26, 2015

mohomran commented Nov 28, 2015

BlGene commented Jan 15, 2016

longjon commented Jan 15, 2016

BlGene commented Jan 18, 2016

shelhamer commented Feb 27, 2016